Estimating the mean and variance from the median, range, and the size of a sample
نویسندگان
چکیده
BACKGROUND Usually the researchers performing meta-analysis of continuous outcomes from clinical trials need their mean value and the variance (or standard deviation) in order to pool data. However, sometimes the published reports of clinical trials only report the median, range and the size of the trial. METHODS In this article we use simple and elementary inequalities and approximations in order to estimate the mean and the variance for such trials. Our estimation is distribution-free, i.e., it makes no assumption on the distribution of the underlying data. RESULTS We found two simple formulas that estimate the mean using the values of the median (m), low and high end of the range (a and b, respectively), and n (the sample size). Using simulations, we show that median can be used to estimate mean when the sample size is larger than 25. For smaller samples our new formula, devised in this paper, should be used. We also estimated the variance of an unknown sample using the median, low and high end of the range, and the sample size. Our estimate is performing as the best estimate in our simulations for very small samples (n < or = 15). For moderately sized samples (15 < n < or = 70), our simulations show that the formula range/4 is the best estimator for the standard deviation (variance). For large samples (n > 70), the formula range/6 gives the best estimator for the standard deviation (variance). We also include an illustrative example of the potential value of our method using reports from the Cochrane review on the role of erythropoietin in anemia due to malignancy. CONCLUSION Using these formulas, we hope to help meta-analysts use clinical trials in their analysis even when not all of the information is available and/or reported.
منابع مشابه
Estimating Variance of the Sample Mean in Two-phase Sampling with Unit Non-response Effect
In sample surveys, we always deal with two types of errors: Sampling error and non-sampling error. One of the most common non-sampling errors is nonresponse. This error happens when some sample units are not observed or viewed but they do not answer some of the questions. The complete prevention of this error is not possible, but it can be significantly reduced. The non-response causes bias and ...
متن کاملSize specific dose estimate (SSDE) for estimating patient dose from CT used in myocardial perfusion SPECT/CT
Objective(s): Size specific dose estimate (SSDE) is a new parameter that includes patient size factor in its calculation. Recent studies have produced mixed results on the utility of SSDE, especially when automatic exposure control (AEC) was used. The objective of the study was to find out if there is a relationship between patient size and each of the parameters, SSDE...
متن کاملEstimation of Variance of Normal Distribution using Ranked Set Sampling
Introduction In some biological, environmental or ecological studies, there are situations in which obtaining exact measurements of sample units are much harder than ranking them in a set of small size without referring to their precise values. In these situations, ranked set sampling (RSS), proposed by McIntyre (1952), can be regarded as an alternative to the usual simple random sampling ...
متن کاملEstimating a Bounded Normal Mean Relative to Squared Error Loss Function
Let be a random sample from a normal distribution with unknown mean and known variance The usual estimator of the mean, i.e., sample mean is the maximum likelihood estimator which under squared error loss function is minimax and admissible estimator. In many practical situations, is known in advance to lie in an interval, say for some In this case, the maximum likelihood estimator...
متن کاملStratified Median Ranked Set Sampling: Optimum and Proportional Allocations
In this paper, for the Stratified Median Ranked Set Sampling (SMRSS), proposed by Ibrahim et al. (2010), we examine the proportional and optimum sample allocations that are two well-known methods for sample allocation in stratified sampling. We show that the variances of the mean estimators of a symmetric population in SMRSS using optimum and proportional allocations to strata are smaller than ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- BMC Medical Research Methodology
دوره 5 شماره
صفحات -
تاریخ انتشار 2005